Maximum Entropy Modeling in Semantic Tagging

نویسندگان

  • Jia Cui
  • Frederick Jelinek
چکیده

The Maximum Entropy maxent principle has been successfully applied in classi cation and tagging tasks Compared with other statistical learning method it allows convenient integration of di erent knowledge sources However it is restricted by the size of the training corpus in that not all knowledge can be incorporated For instance events observed with low counts can be either a particular pattern or just a uke In our work human annotated data is limited but a huge amount of raw data is available and free Moreover extra information can be obtained from WordNet and dictionaries Our proposal will exploit these extra knowledge sources and raw auxiliary data in building reliable maxent models

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum Entropy Modeling in Sparse Semantic Tagging

In this work, we are concerned with a coarse grained semantic analysis over sparse data, which labels all nouns with a set of semantic categories. To get the benefit of unlabeled data, we propose a bootstrapping framework with Maximum Entropy modeling (MaxEnt) as the statistical learning component. During the iterative tagging process, unlabeled data is used not only for better statistical esti...

متن کامل

Reduction of Maximum Entropy Models to Hidden Markov Models

Maximum Entropy (maxent) models are an attractive formalism for statistical models of many types and have been used for a number of purposes, including language modeling (Rosenfeld 1994), part of speech tagging (Ratnaparkhi 1996), prepositional phrase attachment (Ratnaparkhi 1998), sentence breaking (Reynar and Ratnaparkhi 1997) and parsing (Ratnaparkhi 1997). Maxent models allow the combinatio...

متن کامل

Using a Smoothing Maximum Entropy Model for Chinese Nominal Entity Tagging

This paper treats nominal entity tagging as a six-way (five categories plus nonentity) classification problem and applies a smoothing maximum entropy (ME) model with a Gaussian prior to the Chinese nominal entity tagging task. The experimental results show that the model performs consistently better than a ME model using a simple counting cut-off. The results also suggest that simple semantic f...

متن کامل

A Maximum Entropy Approach to FrameNet Tagging

The development of FrameNet, a large database of semantically annotated sentences, has primed research into statistical methods for semantic tagging. We advance previous work by adopting a Maximum Entropy approach and by using Viterbi search to find the highest probability tag sequence for a given sentence. Further we examine the use of syntactic pattern based re-ranking to further increase per...

متن کامل

Maximum entropy modeling for diacritization of Arabic text

We propose a novel modeling framework for automatic diacritization of Arabic text. The framework is based on Markov modeling where each grapheme is modeled as a state emitting a diacritic (or none) from the diacritic space. This space is exactly defined using 13 diacritics and a null-diacritic and covers all the diacritics used in any Arabic text. The state emission probabilities are estimated ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003